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NXT Heparin Affinity Glycgtad. 
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FIGURE 9 



1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGG A 
20 1 TCCAGAAAGT TATTAGATGA AGAAATC AAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
40 1 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTC A 
45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATC ACTTTC TCAACAAC AT CAACGTGCCG AATTGG AATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
105 1 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACA A TTTATTAATG AATTTTTCTA 
1 20 1 CAATATACTC CCCAAAGACT TTTTGACTGG A AG AAACCGT AAGAATTTTC 
125 1 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1 301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1 35 1 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
140 1 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
1 45 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 50 1 AACCTATTAC TACAG AAAG A ATATTTGGG A CGTCATTATG AAAATGTCAA 
1 55 1 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCA ATTAT G ACTTTCAAT AAGAAG ATTG 
1 70 1 TAAATTCAGA CCGG AAGACT ACA AA ATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1 80 1 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGG ATTATGA 
200 1 CTGCACAAAT TCTAAAGAGA AAG AATAAC A TAGTTATCGA TTCGAAAAAC 
205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
215 1 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
220 1 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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235 1 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 

2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 

2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

2551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

265 1 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 

2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 

2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 

2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 

2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 

2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 

295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

K 3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 

O 3051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 

□ 3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 

3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 

3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 

rtl 3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 



FIGURE 10 

1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RMQSQSHYKD 
51 LEDDQFAQT NIVATPRDYN EEDFKV1ARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
1 5 1 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRM 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKJ ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKJCY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWTFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKN1WDV1M KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
75 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
85 1 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



FIGURE 11 



1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTC AT TG ACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
25 1 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
65 1 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAG AAATGG A GCCG A AATCT TA ATC A AA AA -G AATTGCGTC G ATATTGCAA 
1401 AAGAATCGAA CTCTA AATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1 551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1 60 1 ATACAAACCT TGGTC AAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 * * ♦ 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGG AT A'rTTr'rr i*C TTTl^ M r AACTCC ATC AAATCT 

a PQNPXTPKPL * KKKKLR'* FR 

b PKTPKPQNPYKKR KNCGSLE- 

C PKPONPKTPIKKEKIE.VV-K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 * •* 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTA 
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T ACTTCCT AAT AC ATTC AAC AAGT AT AGC AG CTCTTG T AGTGAC AAG AAAGG ATGC AAAA 
ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTAC 



180 



\% a YFLIHSTSIAALVVTRKDAK 
H' b TS'YIQQV^QLL-'OERMQN 
ZiPNTFMKYSSSCS DKXGCKT 
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CATTGAAATCTGCCTCGAAATCGCCTTCATTGACTATTCCAAAGTT^ 
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GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

HCNLARNRLHCLFQ SCKNN 
IEIWLEIAF I DYSKVAKTIR- 
LKSGSKSPSLTIPKLQKQLK- 
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AGTTCTACTTCTCGGATGCAAATC'rTTATAACGATTCTTTCTTGAGAA 
Q TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

Ula SSTSRMOIFITILSCEN'F- 
b VLLLCCXSL * RFFLEK ISFK 

C PYPSDANLYNDSTLRKLVLK 
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AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 
TTTCGCCTCTCGTTTCTC ATCTTT AACTTTGTAATG ATT AC AAATTT ATTTT AGTCC ATT 
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TG AGG ATTATTCTATTTTTT AG AT C ACTTCTT AAGG AGC ATT ATGG AG AAAATT ACTT AA 

" " - 42 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATCAGTATATTAAATT 
ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 
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CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 ♦ * 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

HMRMSQRISIHQTYQRQTRY 

b ICECVKGSRY I RLTKDKLA I 

c YENESKDLDTSDLPKTNSL 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGaAGaACTTATTGCATTTACTATTCG 

541 * * * * 600 

TTTTGCCTTCTTTTTCAAACTATTACCTTGTCCTCT^ 

a KTQEKVC'SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR- 

c NARKSLI IEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 - * ♦ * * 660 

ATACCCAAAATAAT<n^AACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRY R RCTPESCDN-- 

b MGFITIVLGI OGELPSLETI 

C WVLLQLF * VSTVN S RVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAJUVGTTCTC 

661 - - - - 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCaAGACTTTCAAGACTACACATACGGTA 
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\ If TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

if% 721 * ♦ * ♦ 780 

'% ATAAAACACTTAATTAGACTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

m 

<5 . a YFVN'SQISYLNLMDSYRNK 

1, b ILCENLKYLI SI *WIAIETN- 

IJ. c FCELISNILSOFNG-L'KQT- 

^ CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 - - 840 

\ : , ih GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGCAAACCCTGTTTACGTGTG 

■r| a PNKPCKrNG I YVKSFGTNAH 

b 0 I N H A S I M E Y T L N P L G Q M H T - 

Hi e K-TXQV'WNIR-ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCA7ACATACACAGAATGCTTTAGAGACTGATTTAGC 

841 - -- - - 900 

ACT? AAATA7 AACC T AAG AATTTCCT ATCT ATGTGTCTT ACGAAATCTCTGACT AAATCG 

a C IYIGFL.XHRYTECFRDCFS 

b EFt LDS * S IDTQNALETO'LA 

C N L Y * I L K A - I H R M L • R L I • L - 

TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 - - - -* * 960 

AATGTTCTCT^TGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL*KKQ- 

c TTO YLFCLLLLi SY I FKRSR- 

GGCGAAATCAAAAGAACA.CTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 - «► 1020 

CCGCTTTACTTTTCTTCTCATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a CEMKRRLXKEISKFVOSSVT 

b AXCKEO* RKR FQNLL. I L L * P 

c RMEKKTS ERO FK I CC FFCNR- 

CGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGACCTATCACAATCCTGATTC 

1021 1080 

CCTTAATTGTTGTTCTTATAATCGTTCCTTTTTCTTCTTCTCCATAGTCTTAGGACTAAG 

a G ! N N X N I SNE K EEELSQSCF 

b ELTTR t^ATK KKKSYHNPDS 

c n**ooey*o?krrra ITILIL- 



FIGURE 12 (cont.) 



a 
b 
c 



a 
b 



a 
b 

c 



yl a 

r." b 

c 

m 



& a 

.W c 



fll * 
c 



a 

b 
c 



a 
b 
c 



a 
b 
c 



a 
b 
c 



TT AAAG ATTTC AAAAATTCC AG GT AAG AG AG ATA C ATTC ATT AAAATTC AT AT ATT AT AG 

10 81 — - — 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAG 

LKISXIPGKRDTFIK I H I L * 

• RFQKFQVREIHSLXFIYYS 
KDFKNSR ' ERYIH-NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAA 

1141 * * * — * 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

FFISQLLFSFILTIFFD'LE 
FSFHSCYFLLS * Q Y F L I S W K - 
FHFTAVI FFY L N N I FCLAGS- 

GTAAAAAGTATCAAATAAGJVGAAGCGCTAGACTGAGGTAACTTAG<riTATTCACATTCAT 
12 01 ♦ * * 1260 

VKSIK'EKR * T E V T * L IH IH 

• KVSNKRSARLR * LSLFTF I - 
KKYQIR EALDCGNLAYSHS • - 

AGATCGACCrrCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 ♦ - ♦ 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTT^ 

RSTFIYPIRC'GNSSHPF'K 
DRPSYIQYODKETAV I RFKN 
IDLHISNTM1RKQQSSVLXI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAA^ 

1321 * - 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTA 

CYED-IFRVKKWSRNLNQK 
SAHRTXFLESRNGAE I L I K X - 
VLCGLWF * SQEHEPKS * SKR- 

GAATTGCGTCCATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 - - 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

ELRHYCKR I EL* I FR * * V L P 
NCVDIAXESNSXSFVNXYYQ 
IASILQKNRTLNLSL I 3 I T N - 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 ■ - 1500 

TAGAACTAACTXACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

I LI0CRO - RCNCTEDH - R N K 
SCLIEEI DEATAQK I I X £ I X 
LDCLXRLTRQLHRRSLXK " S - 

GT AACTTTT ATT AATT AG AG AAT AAACT AAATT ACT AAT AT AG A G AT C ACCG ATCTTC AA 

1501 - 1560 

C ATTG AAAATAJ^TTAATCTCTT ATTTG ATTT AATG ATTAT A TCTCT AGTCGCT AG AAGTT 

VTFIN'RIN-ITNIEISOLQ 

• LLLIRE * T X L L I R S A I F N - 
NFY'LENXLNY • Y R D Q R S S I - 

TTGACG AAAT AAAAGCTG AACTAAACTT AG AC AAT AAAAAAT AC AAAC CTTGCTCAAAAT 

1561 - - 1620 

AACTGCTTrATTTTCGACTTGATTTCAATCTC 

L T K * X L N * S • TIXNTNLCQN 
CRNKSCTKVRQ* K IQTLVK I - 
DEIXAELKLDNKKYK P W S X Y - 

ATTGAGG AAGGAAAAG AAG ACC AGTTAGC AAAAG AAAAAAT AAGGC AAT AAAT AAAATG A 

1621 1680 

TAACTCCTTCCTTTTCTTCTCGTCAATCGTTTTCTTTTTTATTCCCTTATTTATTTTACT 

I EECXEOQ LA KEK IRQ- i K C 
LRKZXKT S Q K X K G N X N E 

CGRXRRPVSK RXNXA INKHS- 
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GT AC AC AAGTG AAG AAATAAAAG ATTTATTTTTTTC AATAATTTATTGAAAAG ACGGGTT 
1681 - ♦ ♦ + ♦ + 1740 

C ATCTC TTC ACTTCTTTATTTTCT AAAT AAAAAAAGTTATTAAATAACTTTTCTCCCC AA 

a VQKCRNKRFIFFNNLLKRCV 

b YRSEEIKDLFFSI IYCKECF- 

C TEVKK'KIYFFO" FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 1762 

AACCCCAAAACCCCAAAACCCC 

a LCrWGFG 
b WGrGVLG 
c G V L G F W 
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2 EVDV3N0ADNHG I HS ALKTC E EI KEAKTLY SW I QKV IRCRNQSQSHY KDL 51 
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52 EDIX'FAQTNIVATPRDYNEEDFKVIARKEVF . STGLMIELIDKCLVELL 100 
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6 3 DERRYI ITKALL EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF . 107 

101 SSSDVSOROKLOCFGFOLKGNQLAKTHLLTALSTQKOYFFODEWNOVRAM 150 

1 0 8 CWHKNTQPFI EKYFNKAVLLPNDLLEVCEFAQVLY I 14 4 

I 5 1 IGNELFRHLYTKYLI FQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 200 

::. : .|| | ..|.... ::: :: 

145 FDA7EFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNI PRCC STC K YNVKNEKDHFLNNINVPNWNNMKSRTR I FYCTHF 247 

::J! : . | . . . : : | . . | . . | : . | . : 
182 SEFNEYQL.GKYCTES . . QRKKTMFR YLSVTNKQKWDQTKKK 220 
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FIGURE 18 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
6 1 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
1 2 1 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcaclt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
66 1 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca-aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggncc gttcgtactl gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1741 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1 80 1 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
2041 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
2221 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 
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FIGURE 20 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLAVYIIWELYIRTTTN 
YIVAFC WHKNTQPFIEK YFNKAVLLPNDLLEVCEFAQVLYIFDATEFKMLYLDRILS 
QDIRKJELTFRKCLQRCVRSKJ^SEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKV^TK 
KKIIKENLLTTCLQAIKESEDKSKJRET 

APKIPNSTLES KYLTFKJDLI KFC HI S EPKERVYKILGKKYPKTEEEYKAAFGDS AS AP 
FNPELAGKRNOQEISKTWENELSAKGhrTAEVWDNLISSNQLPYMAMLRNLSNILKAGV 
SDTTHSIVINKICEPKAVENSKMFPLQFFSAIEAVN^^ 
AVKJEVVEKTDEEKJCDMELEQTEEGEFVKVNEGIGKQYI^ 

HTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQ 
CNKCYLEVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHVDNIVILSD 
MMIAEG YS DINVRGS SIVNS IKK YKDEVNPNIKIFA VDLEG YGKCLNLGDEFNENNYT 
KIFGMSDSILKFISAKQGGANMVEVIKNFALQKIGQK 



FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
48 1 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa atttttlgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 actttlgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
174 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatactttt aagtagaacg aatmaan taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcttag aagatanga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattm caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
252 1 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 



FIGURE 22 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 

EEDLKLLKFKMQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 
DLNLNEDENKJCNGLSEQQVKJEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 
ETDYDTEKWFEISHDQKKYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAE 

FYAFDDFSQTIKLTNNSYQTVNIDVNFDNNI£I 

QYNFEKJGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 
STDLKLVDTNKVQDYFKPLQEFPRLTHVSQQAIPVSATNAVEh^NVLLKXVKHANLNL 
fc VSIPTQFNFDFYFVNLQHLKJLEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 
! VAQETSRKQILKQATTIKNLKNNKNQEETPETKJDETPSESTSGMKFFDHL^ 
= FSVNLQATQEIYDSLHKLLIRSTNLKJCFKI^YKYEME 

i RCSVNISNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYT7KQNEFQFNNVKSAKIE 

3 
I 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 
NLEhTVSINCILDQHlLNSISEFLEKNKJCIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 
j LNQVYINQQLEELTVSEVHKQV\^NHKQKAFYEPLCEFIK£SSQTLQLIDFDQNTVSD 
1 DSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 
LCLPTGTYYDYNSDRW 



FIGURE 23 

MKILFEFIQDKLDIDLQTNSTYKJENLKCGHFNGLDEILTTCFAL 

PNSRKJALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYK1ARNEDVNNSLFCHS 

ANVmrTLLKGAAWKMFHSLVGTYAFVDLLINYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLR 

EAlFPTNLVKJPQRLKVPvWLTLQKLLKRHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKFIIVILQKLLPQEMFGSKKNKGKJIKM^NLLLSLPLNGYLPFDSLLKKL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VWRHDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMPJIPKKSNNEFR 

UAIPCRGADEEEFnYKJENHKNAIQPTQKJLEYLRNKRPTSFTKTYSPTQlADRJKE 

FKQRLLKKFhfhA/LPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TNTG VLKLFNWN AS R VPKP Y ELYIDNVRTVHLSNQDVIN VVEMEIFKTALWVEDKC Y 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLnSTDQQQV 

rNIKKLAMGGFQKYNAKANRDKJLAVSSQSDDDTVIQFCA^OTFVKELEVWKHSSTMN 

NFHIRSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSrNVTQNMQFHSFLQRIIEMTVSGCPlTKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKJDNIILLRKEIQHLQAYIYIYIHIVN 



FIGURE 24 

Oxytricha LCVSY1LSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

AXFLHWI^SVYVVEIXRSPFYVTETTFQKNR 
1 8E I EWLVXX5KR6 NAXMCLSDFEKRKQ I FAEFI YWL Ytf SFI I P ILQSFF Y I TESS DLRNR 

LKDFRWLFISD IWFTKHNFEHLNQLAICFISHLPRQLIPKIIQTPFYCTBIS5TVT- 

TRE I SVWQVET - S AXH P Y YFDHEN - I Y VLWKLLRW I PEDLWSLr I RCFF Y VTEQQXS YS X 

+ -* * * # * 

• • • • » • • 

Motif 1 

IJ^FYRKSVWSKLQSIGlRQHLKRVQLRDVSEAEVRQHREARPALLTSRLRPIPKP — POL 
TVYFRKDlWXLIXRPFI«TSMKMEAFEKIHEHNVRMDTQK-TTliPPAVIRl^PKK~-KTP 
IVYFRHDTWNKLITPFIVEYFKTYLVEKNVCRWHKSYTLS — RFNBSKMRI IPKK5MMEF 
TY YYRKNlWDVIMKMS I - ADI*XXETLA£VQEXEVBEWKXS -I/GFAPGXL,RL,I PKK — TTF 
..*.♦•** .* .** 

Hot if 2 

RP I VNHDYWGARTFRREXRAERLTSRVXALF -8 VLNYERA 

RLITN-LRKRFLIKMGSHKKMLV6TNQTLRPVA8ILKHL,INEESSGIPFHLEVYMKLLTF 
RI I A I PCRGADEEEFT I YX EHHXHAI QPTQKI I*E YLRNXRPTS FTX I YS PTQ I ADR I KEF 
RPlMTFHKKIVNSDRKTTKLTTHTKtXKSHLMUCTLKH-RMFKDPPGPAVPKYDDVMXKY 



Kotif 3 (A) 

KKDLLKHRMFGR-KXYFVRIDIKSCYDRIKQDLKFRIVKR-ICLKDPEPVIRKYATIHATS 
KQRlrLrKKFNNVLPELYFMKFDVKSCYOSlPRMECMRILKD-ALKNENGFFVRSQYFF^TN 
EBFVCKWKQVGCPKLFPATMDIEXCYDSVNREXLSTFLKTTXLLSSDFWIMTAQILKRXM 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIGURE 27 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 



FIGURE 28 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTT ATGTC ACGGAG ACC ACGTTTCAAAAG AAC AGGCTC mTI CT ACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 
GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 
AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 
GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 
CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 
GTGCTC A ACTACG AGCGGGCGCG 



FIGURE 29 



MTEHHTPKSR1LRFLENQYVYLCTLNDYVQLVLRGSPASSYSN1CERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLE1IGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LIMAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPYIDTHDDEK1LS 

YSLKPNQVFAFLRS1LVRVFPKLIWGNQRJFEIILKDLETFLKLSRYESFSLHYLMSNJKISEIEWLVL 

GKRSNAKMCLSDFEKJIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKINEh^RMDTQKTTLPPAVIRLLPKKNTTRLITNLRKRFLIKMGSNKKMLVSTNQT 

LRPVASILKJHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVFRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKKFLNLSLRGFEKIWFSTSLEKTVINFENSNGIIWTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFID1THNSKFNSCCNJ 

YRLGYSMCMRAQAYLKJIMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 



FIGURE 30 



ggtaccgamacmccmcncataagctaattgcttcctcgaacgctcxtaaatctctggaaatamttacaagaactcaataacaataccaagtcaaattccaatatgaa^ 

tgnattagtgatcgamtamctattttatcggtcgttaccaagtataaggacaaaaag 

gg ttcgcttactmaatcgtggtactgtmagctgctacttctaga*accgcgtg^^^ 

ctgatgagactatattagancattacagtccgtgcatattcttaacatggagccttacacmagatgagtcacgtcgcatgatggagtemggtatcatccaacgm 

aaaaggttgataattatttgcaaaatcatgtccta^ 
ctccmaacgcggttttattmctatmctattctcatgpgttra^ 

aataatctaaanagracgcnataangatagtag^agaaagattggtgattctactcgtgtaatgttattagtttaaagatacmgcaaaa 

aaaatcctataanataaaunaatcaatamgcggtcactatttatttaaaacgttatgatcagtaggacacmgcatatatatagttatgcttaatggttac 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

CCTTAAATGATTATGTACAACTTGTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTAGTCGGCTTCGACAGT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGg ^tatat attmgttttgatttttttctattcg 

ggatagctaatalatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaattgtgaaatamacctgcaattactgmcaaagaga ngtama accgataaag^ 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccg gttaagatg ngcgcacmgaaca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aactaatactgnatccncataacmtmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAA TATTA AAAGg 

tattgtataaaattuttaccactaacgattttaccagACCTCGAAACTTTCT^ 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaattaacaatcagATTTCAGAAATTGAATGGCTA 

ccttggaaaaaggtcaaatgcgaaaatgtgcttaagtgattttgaga aacgcaag caaatatttgcgg 

AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATCTTTTTTTTATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtattttaaagtattttttgcaaaaagctaatattttcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtattaatttttggtcatcaatgtacmacttctaatctato 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaamtgcagGCGTAAGAAG 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 

AAAAAGAAACTCAAGGATCCCGAATTT GTAAT TCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaag^ 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATTTTGT 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaattgttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagttctaaccgttgaagGATTTGAGAA 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATACTTTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 

ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGG AA ATC T \ ' 1 1111 1 A CAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacttattttaactagaaaagtcattaattaaccttagATc till GAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 



« 4 
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ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTTTTTTGTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcatmcaamanatatacatcctt 

tattactggtgtcttaaacaatattattactM^ 

atccnatacttttaagaaagaagacagtggttgctgactectgcccacatgcccattaaacgggagtggttaaacattaaaagtaatacatga^ 
aamggaaagtggtmctataatgaaiaatgcccgcactaatgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatat 

acccagtgttgttgaagaaagcaaggammggaacaagcttctgcagatgacaggcmatm^ 
gccngctactgagacgaaaagaaactaaggatagmgaatactaatagctcatttaatgtcttatataaggtmgtttmcctgac^^ 

gtgtmgccattattggattccgaaatagccaaamcttggttcctcaaagc^ 

gaatcttccaccgatgaggaaatggatagcnatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcangggagacaictcngatgaatcagatgcgga 
gagtatctccagcggatccngatgtcaataacnctatttctgaaatgtatggtcctactgtcgcttcgacnctcgtagctctacgcagnaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FFYCTEISST V$XVYFRHDT WN KLIT ? FIVE YFK-TYLVEN 

FFYVTEQQKS YSKTYYYRKN IWDVI-MKMS IAD LKK ETLA— EVQE 

KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQFSEK NKQMKNNFYQ 

FFY.TE. . K. .S..YYYRK, IW_.-KL F. .K V. . 



40 
43 
44 

50 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



NVCRKHNSY 

KEVEEWKKSL 

KIQLEEENLE KVEEKLIPED SFQKYPQGK$8li?l_ 

k...e f. .die;icEiEEk 



TLSNFWH^^lPlj^a^NNE ^M|KlPCRG 
' — PRE J yfTFNKK 



FFEITTFLRK 



- lESHlMTF.RK 



79 
78 
92 

100 



t!*EST2 pep 
yEuplotes pep 
ClTrans of tetrahymen 

¥ Consensus 



\ JJEST2 pep 
-^Euplotes pep 
UlTrans of tetrahymen 

* Consensus 



ADEESFTIYK ENHKNAIQP^VqkilE^D 
rVNSDRKTTK LTTNTKLL^HLMLia EkN 
DKQKNIK V^J^QlLm^il^r'^^ 



.K. .K LN.N- .L. .S~QL.L 



tpNK RPTSFTKIYS FTQIADRIKE 

RHFK -DPFGFAVFN 

ML-G -QXIGYSVFD 

.IG. .YF'. 



fkqrl: 



.K- 



NVL 

EFVCKWKQVG d| 
•A QFIEKWXNKG fS 



■frFMKFD VKSCYD 
?FATMD IEKCYD 
g{fYVTL- 



.HKF. .F..KWK..G MJWF.T.D . . .CYD 



129 
120 
130 

150 



157 
155 
158 

186 



• % 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RQH LKR VQL RDV SEA EVR QHR EA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AKF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YA^V 
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A. 



B2 



Vector Genomic DNA Insert 



A5 



5825 bp Sequenced 
-2 kb Hind III Fragment 




Vector 



\ kb 



tezJ + 



RT Motifs 1 2 3(A) 4(B*) 5(C) 6(D) 
1/ \ \ 1 



mm m*\ mB s omiiililili 



Introns 



2 3 



cDNA 
5-20 



cDNA 




6 7 8 9 10 11 




Hind III Xca 

Original PCR 
3' RT-PCR 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15Bdnc 
RT-PCR w/ M2-B15 Bene 
RT-PCR w/ M2-B16 Bcfnc 



Hind III 



500 bp 
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Poly 4 

t t c 

ta a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B') 



5(C) 

D D Y L L I T 

3'- ctg ctg atg gag gag tag tgg -5' 

a a aaaaa a a 

t t t t 

c c 

Poly i 
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Motif B' (4) Motif C (5) 

QTKGIPQG DDYLLIT 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



ot 

Ea_pl23 

Sp_M2 

Sc_pl03 



LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
KGIPQGLCVSSILSSFYYATLEESSLGFI.RDESMNPEKrPNVNLI.MRLTDDYLI.IT 

SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 

it * * • * * • 



Q "ff 
caa!!^aa 


V 

gtt 


G 

ggt 


I 

ate 


P 

cct 


Q 

cag 


G 

gg. . 






< 


•Actual Genomic Sequence. 


































Pol^| 4 










































t 
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t d*jf a 


g 


c 


c 


t 


c g 




























ca#%cc 

t n 


aaa 


gga 


att 


cca 


taa 


gg - 


> 






















ag acc 
t<Otgg 


aaa 
ttt 


gga 
cct 


att 
taa 


cca 
ggt 


tea 
agt 


ggC 
ccG 


TCA 
AGT 


ATT 
TAA 


CTG 
GAC 


TCA 
AGT 


TCT 
AGA 


TTT 
AAA 


TTG 
AAC 


TGT 
ACA 


CAT 
GTA 


TTC 
AAG 


TAT 
ATA 


ATG 
TAC 




K 


G 


I 


P 


S 


G 


S 


I 


L 


S 


S 


F 


L 


C 


H 


F 


Y 


M 


fa 

GAA GAT 
CTT CTA 


TTG 
AAC 


ATT 
TAA 


GAT 
CTA 


GAA 
CTT 


TAG 
ATG 


CTA 
GAT 


TCG 
AGC 


TTT 
AAA 


ACG 
TGC 


AAA 
TTT 


AAG 
TTC 


AAA 
TTT 


GGA 
CCT 


TCA 
AGT 


GTG 
CAC 


TTG 
AAC 


TTA 
AAT 


CGA 
GCT 


E D 


L 


I 


D 


E 


Y 


L 


S 


F 


T 


K 


K 


K 


G 


S 


V 


L 


L 


R 



GTA GTC gac gac tac etc etc ate acc 
CAT CAG ctg ctg atg gag gag tag tgg 

V V D D Y L L I T 

< — ctg ctg atg gag gag tag tgg 
a a aaaaa a a 
t t t t 
c c 

Poly i 



gac gat ttc etc ttt ata aca 

D D F L F I T 



< Actual Genomic Sequence. 
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3' RT PCR Strategy 



mRNA 



5' 



IAAAAAAAAAAAAAAAA 
T TTTTTTTTTTTTTT 




1 . Synthesis of cONA with Q| Primer. 
j«* mRNA 



IAAAAAAAAAAAAAAAA. 
■TTTTTTTTTTTTTTT ■ 



2:|first Round PCR Using Outside Primer and Qq Primer. 



m 3* 



[ T TTTTTTTTTTTTTT I 



^.Second Round PCR Using inside Primer and Q| Primer. 



HI 
Q 
I'll 



4. Sequence Second Round PCR Products Using inside Primer or Q| Primer. 
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-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5 - 6 kb 
7-8kb 
11 - 12 kb 

-Libraries from J.A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 





GAD (Gal Activation Domain) Library 




REP Library from R. Allshire 
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Hind III Digested Positive Genomic Clones 
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5* RT PCR Strategy 



mRNA 



I AAAAAAAAAAAAAAAA . 



ddA- 



]. Synthesis of cDNA with Specific Downstream Primer. 

mRNA 



5* 



I AAAAAAAAAAAAAAAA . 



feligate Oligo with 5'-P and blocked 3* to cDNA using T4 RNA Ligase. 



; 5= 



|JL first Round PCR 



4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429) . WLYNSFIIPILQSFFYITESSDLRNRT\nirFRKDIW . . . (35) . . . 

S.C Est2p (366). WLFRQLI PKI I QTFFYCTEI SSTVT- IVYFRHDTW ...(35)... 

E.a. pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW ...(35)... 

* * * * * * * * * 

Motif 1 Motif 2 K 
p hh h K hR h R 
S.p. Tezlp AVTRL-LPKK- -NTFRLITN- LRKRF ...(61)... 
S.C. Est2p SKMRI I PKKSNNEFRIIAIPCRGAD ...(62)... 
E.a. pl23 GKLRLIPKK— TTFRPIMTFNKKIV ...(61)... 

* * * * * * * 

Motif 3(A) AF 

h hDh GY h 
S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFR IVK ...(89)... 
S.C. Est2p ELYFMKFDVKSCYDSIPRMECMRILK . . . (75) . . . 
E.a. pi 2 3 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 

* * *** * 

Motif 4(B') 

hPQG pP hh h 
S.p. Tezlp YLQKVG I PQGS I LS SFLCHFYMEDL I DEYLS F ...(6)... 
S.C . Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF . . . (8) . . - 
E.a. pl23 YKQTKGIPQGLCVSSILSSFYYATLEESSLGF ...(14)... 



y Y Motif 5(C) Motif 6(D) 

111 h F DDhhh Gh h cK h 

S.p. Tezlp VLLRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENS .(205) 

S.c. Es t2p LI LKLADDFLI I STDQ0^VINIKKLAMGGFQKY1JAKA.NRDKILAVSSQS .(173) 

E.a. pl23 LLMRLTDDYLL I TTQENNAVLF I EKLINVS RENGF KFMMKKLQTSF PL S . (209) 



* * * * 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or uro4 marker. 



M 



telomerase gene::M 
wild type telomerase gene 




wild type telomerase gene 



Wild Type (X2) 



2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 




Senescence? (X2) 



(These cells will show a senescence phenofype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1 m T EHHTPKSRI LRFLENQYV 20 

o 

1019QTAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
2l;jT L CTLNDYVQLVLRGS PASS 40 

107^||PAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
41j|f SNICERLRSDVQTSFSIFL 60 

1139 f? CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
6Jf«p S TVVGFDSKPDEGVQFSSP 80 
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1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 
81 K C S Q S £ B6 

1273 CTA ATAGCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 
87L IANVVKQMFDES FERRRNL 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtat tctaattgtgaaatat ttacctgcaattactgt ttcaaagaga 1405 
107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 
114 NHEDFRAMHVNGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129 D LVSTFPNYLISI LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 
149|Ji L L E I I G 155 

1602^GT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 
156jl| D AMHYLLSKGSI FEALPND 175 

1662'JaAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 
17 6;S|j Y LQISGIPL FKNNVFEETV 195 



1722^CA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

196* S K KRKRTI ET S I T QNKSARK 215 

1782^feAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

216hb V.SWNSISISRFSIFYRSSY 235 

HI 

1842Q&AG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaattttag AT CTA TAT TTT AAC 

i907f|| 

236 K K F K Q D L Y F N 245 

1908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 L H S I CDRNTV HMW LQWI F P R 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 QFG LIN AFQVKQLHKVI PLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 S Q STVVPKRL LKVYPLI EQT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 A K RLHR I S L S KV Y NHYC PY I 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

326 D T HDDEKILS YS LKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

346 F L RSILVRVFPKLIWGNQRI 365 

2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattt taccag AC CTC GAA ACT 2336 

366 F E I I L K D LET 375 



% % 
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2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 F LKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 24 65 

396 ~ I SEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KRSNAKMCLSDF EKRKQI FA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFI I PILQSFFY 445 



2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2645 

446 ITESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

46J> L LCRPFITSMKMEAFEKINE 485 

21Q% gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

4$| NNVRMDTQKT 495 

pi 

27f| ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

4§M TL PPAVIRLLPKKNTFRL.it 515 

Hi 

■ > 

28^6 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtat taatttttggtcatcaatgtactttacttctaatc tatta 2906 

5%§ N LRKRFLIK 524 

; £ 

29$ ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

MGSNKKMLVSTNQTLRPV 542 

Co 

29&8 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

5ii ASILKHLINEESSGI PFNLE 562 

3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 VYMKLL.TFK KDLLKHRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRI DI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 K S CYDR IKQDLMFRIVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 

612 K DPEFVIRKYATIHATSDRA 631 



3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 T K NFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SD TLFVDFVDYWTKSSSE I F 679 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 

680 K M LKEHLSGHIVK 692 



% % 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 IGNSQYLQKVGI P Q G S 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 

709 I L S S F LCH F Y M EDL I D EY L S 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 FTKKKGSVLLRVVDDFLFI T 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtca ttCC 3777 

749 V'NKKDAKXFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 I NF^ENSNGI INNTFFNESK K 798 

La 

390 i=5 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 

79KR MPFFGFSVNMRSLDTLLAC 818 

■ m 

396I f :CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

8lf"p K IDEALFNSTSVELTKHMG 838 

402|;fAAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 

83¥ f K SFFYKILR S 848 

409pAGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

84^ S LASFAQVFIDITHNSKFNS 868 

41SSITGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 

86® C C NIYRLGYSMCMRAQAYLK 888 

m 

4210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 

889 R MKDIFIPQRMFITD 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA JGGA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 

918 E I LGYTSRRF LSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 

936 LFCLGMRDGLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 

947 PSFKYHPCFEQLIYQFQSLT 966 

4529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 

967 D LIKPLRPVLRQVLFLHRRI 986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatat tat tactaagtata 4665 
987 ad* 989 
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4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 4745 

4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 

4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 

4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 

5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 

5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 

5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcctcaaaaactcc 5305 

5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 

5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 

5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 




rii 



% 
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i 

met ser val tyr val val glu leu leu 
G CC AAGTTCCTGC ACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 



10 

arg ser phe phe tyr 
AGG TCT TTC TTT TAT 



leu phe phe tyr arg 
CTC TTT TTC TAC CGG 



val thr glu thr thr 
GTC ACG GAG ACC ACG 

30 

lys ser val trp ser 
AAG AGT GTC TGG AGC 



20 

phe gin lys asn arg 
TTT CAA AAG AAC AGG 



lys leu gin ser ile 
AAG TTG CAA AGC ATT 



40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 



S 60 

'if glu a la glu val arg gin his 

il GAA GCA GAG GTC AGG CAG CAT 

'ft thr ser arg leu arg phe ile 
yl ACG TCC AGA CTC CGC TTC ATC 

O 90 

H ile val asn met asp tyr val 

H ATT GTG AAC ATG GAC TAC GTC 

SI 

Q ioo 

m glu lys ala glu arg leu 

GAA AAG ARG GCC GAG CGT CTC 

■ 120 

ser val leu asn tyr glu arg 
AGC GTG CTC AAC TAC GAG CGG 

130 

ala ser val leu gly leu asp 
GCC TCT GTG CTG GGC CTG GAC 



arg glu ala arg pro ala leu leu 
CGG GAA GCC AGG CCC GCC CTG CTG 

80 

pro lys pro asp gly leu arg pro 
CCC AAG CCT GAC GGG CTG CGG CCG 



val gly ala arg thr phe arg arg 
GTG GGA GCC AGA ACG TTC CGC AGA 

110 

thr ser arg val lys ala leu phe 
ACC TCG AGG GTG AAG GCA CTG TTC 



ala arg arg pro gly leu leu gly 
GCG CGG CGC CCC GGC CTC CTG GGC 

140 

asp ile his arg ala trp arg thr 
GAT ATC CAC AGG GCC TGG CGC ACC 



phe val leu arg val 
TTC GTG CTG CGT GTG 

160 

phe val lys val asp 
TTT GTC AAG GTG GAT 

asp arg leu thr glu 
GAC AGG CTC ACG GAG 



150 

arg ala gin asp pro 
CGG GCC CAG GAC CCG 



val thr gly ala tyr 
GTG ACG GGC GCG TAC 

180 

val ile ala ser ile 
GTC ATC GCC AGC ATC 



pro pro glu leu tyr 
CCG CCT GAG CTG TAC 

170 

asp thr ile pro gin 
GAC ACC ATC CCC CAG 



ile lys pro gin asn 
ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 

GTG GTG AAC TTC CCT 
310 

phe val gin met pro. 

TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 

GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 

CTG GGT GGC ACG GCT 
320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

4 00 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 

CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG .GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 

CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

Motif 0 phhhK hRh R 

Ep p 1 23 . . . KS LGF APGKLRLI PKKT— TFRPIMTFNKKI V . . . 

Sp Tezl ...QKTTLPPAVIRLLPKKN--TFRLITNLRKRFL... 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD... 

Hs TCP 1 ...ARPALLTSRLRFIPKPD-GLRPIVNMDYVVG... 
consensus R PK R I 

AF 

Motif A h hDh GY h 

Eppl23 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRJVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP 1 ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 
consensus F D YD 

Motif B hPQG pS hh 

Ep pl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C hFDDhhh 

Ep p 1 23 ...PNVNLLMRLTDD YLLITTQENN... 

SpTezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D GhhcK 

E PP 123 ...NVSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

ScEst2 ...KKLAMGGFQKYNAKA... 

HsTCPl . . . LRTLVRG VPE YGC V V . . . 
consensus G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCG ATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCC ACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCC AGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
30 1 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
35 1 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
40 1 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
50 1 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
60 1 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
801 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGC A CTTCCTCTAC 
1 05 1 TCCTCAGGCG AC AAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
1151 TTCC AGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCC A 
1 20 1 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTC AAG A CGCACTGCCC GCTGCGAGCT 
1 30 1 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1401 CTGCTCCGCC AGC AC AGC AG CCCCTGGC AG GTGTACGGCT TCGTGCGGGC 
1 45 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTC AGG AAC ACCAAGAAGT TC ATCTCCCT GGGG AAGC AT 
1 55 1 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1601 CGCTTGGCTG CGC AGG AGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
1 701 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 80 1 TGCAAAGCAT TGGAATCAGA C AGC ACTTGA AGAGGGTGCA GCTGCGGGAG 
1 85 1 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1 90 1 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCG ATTG 
1 95 1 TGAACATGGA CTACGTCGTG GGAGCCAGA A CGTTCCGCAG AGAAAAGAGG 
2001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTG AGCTGTA CTTTGTC AAG GTGGATGTGA CGGGCGCGTA 
220 1 CG AC ACCATC CCCC AGGAC A GGCTC ACGGA GGTC ATCGCC AGC ATC ATCA 
225 1 AACCCCAGAA C ACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
GCTCCATCCT CTCCACGCTG CTCTGC AGCC TGTGCTACGG CGACATGGAG 
AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
GGCTTTTGTT CAGATGCCGG CCC ACGGCCT ATTCCCCTGG TGCGGCCTGC 
TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
ATCTACAAGA TCCTCCTGCT GC AGGCGTAC AGGTTTCACG CATGTGTGCT 
GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT G AAAGCCAAG 
AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
CTCCCC ACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
GCCCTGCCCT CCTTTGCCTT CCACCCCCAC C ATCCAGGTG GAGACCCTG A 
GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 



FIGURE 51 



+- „ + + + + + 

aiTOX^ACGCAGGAO^CGOGT^ 



A A I, R P A A H V G £ F C P C H P R U A - 
OKCVLLRTWEALAPATPAMP- 
SAASCCARGKPWFRPPPRCR- 



+ 4 |. ) V 

CGCGCGAGGGGCGACGGCT^^ 



ARSPl>PSRALP A A Q P L V R G A 
R A PKCRAVRSLLRSHY R E V Tj - 
A L P A A E P C A P C C A A T T A R C C - 

GCCGCTCGCXACGT^^ 

121 + * * * 180 

CTGCGACCGCIGGftAG^^ 

A A G H V R A A P G A P G I. A A G A A R 
P L A T F V R R L G P Q G W R L V Q R G - 
RWPRSC6AWGPRACCWCSAC- 

G£AC<]CGGGGGCTTICCGaOT 
181 * + — + + ♦ ?40 

CCTCGGCCGCOTAVVG^ 

G P G G F P F ? G G P ? f G V R A L G ? 
I? P A A V R A ? V A ? C ? V C V p W ? ? 
TKRLSARWWP7AWCACF G ? ? - 

24i .- — -—-I— - — — + 300 

? A APPRPLLPPCVLPE V ? G 6 - 
? ? p P A * A P S FRQVSCL? ? L V A - 
S?PPPPPPSARCPA*??MMP- 

CC&AGKXTOCANMIS^^ 

301 + + 1 4 + — + 360 

gcctxiacgacgtnw^^^ 

psaa?avr?rre ? fi a c l r l r 
p. v l ? ? t. c ? r g a ? n v l a f c f a - 

£ C C V V CA?AAR?TCWPSASR- 

GCTGCTOGAiWGCXsOCC^ 
361 * * * * i— ♦ 420 

AAGRG r R G PPRGLHHQRAQL 
LLOGARGG F PEAFTTSVRSV 



FIGURE 51 (cont) 



CCTCCCCAAttOG^^ 

421 : . ♦ - *» 

CCAGGOOTCTXC^^ 

PAQHGDRRTAGEFGVGAAAA 
L P N T V T D A T. RGSGAWGLLLK - 
C p T P- * F T H C C C A G R G G C C C A - 



, _ , _ ^. „ _ — t ■+ 

GceCCACCtXCSVXtt^ 



PRCRRRAGSPAGTLR? ? C A '7 
K V C D D V L V H L L A F C A ? F V L V 

AWATTCWFTCWHAAR ? T. C W W 



GGNTCCCACCTGCGCCTAOT 
_ _________ _ «- ------**•-+-•* - -- -- — + + 

CCNAi^CrTOA^ 



GSQLRLP?VRAAAVPARBC? - 
V P S C A Y ? V C G PPLY&LGAAT- 
paapt?cackRC t _: £ a L V l - 



. ... 4 , — — s 1 * i 

AtfTCOSG<XXBGGG^^ 



scpapatr?wtr?ri»gsngp 

0 A R P P P IIA?GPE?VWOPT G L - 
RPGPRHTT. ?Dr?ASGIQRAW- 



0a*ACCATAGCG_O>J_^ 
CCTTOGTAT03C 



CTIASCRPCSPWAASPGCEE - 
K P * R 0 G G R C P P G L P A PGA R R - 
WHSVR'EAGVPLGC 0 P R V R G G - 



OXXXXXXXGTCACCGTC^ 



ARGQCQPKSAVAQEAQAWRC 
K G C S A S R S L P L P K R P R R G A A - 
A r; A V r A F. V C R C P R C? P G V A L ? - 



GttSACTCGGOCT^^ 



F "AC A D A R W A C V LGFFGQDA 
PEPERTPVG Q GSWAHPCRTP 
T. S RSGRPT. GF^PGPTR A C R L 



% % 

FIGURE 51 (cont.) 



H4] + * + + • + 900 

HTE*PWFLCGVTCQTBRRSH - 



b CPSDRGFCVVSPARPAEEAT 
DRVTVVSVWCHLPDPPKKPP 



c 

CIOTlGGAGGGtt^^ 

901 + * -♦ * * 900 

CAGAJ^(XTOCCACGCGAC^CACCGTV^^ 

jj LFGGCALWHAFLFPXRGrr'A 
ti SLEGALSCTHriSHPSVGROn- 
LWRVRSLARATPTHPWAAST- 

CCACGajGGCDX^ 

S61 + +~- — + 4 * -i 1020 

a pftCPPIHIAATTSWDTFCPP 

f|| b HAG P P S T 5 P P PR PGTR L V .p P. 

; r r| c TRAFHPHRGHHVLGHALS P G - 

* f 

GTCTAOXX^aACOtf^^ 

1021 + * + ♦ -■. + .. * 1U»U 

f«* CACATGCGGCTCTtTCrT^~C^ 

H : a VYAETKHFLYSSCDK?TA?L - 

111 b CTPRPSTSSTPGAT?TLRP*- 

f*t V k k D Q A L P L L L R P. Q ? H C ? P P - 

' M CTTTCTAttOATC^^ 

1081 + * — + — + U-io 

GAAQGATCAGTTATAT.VJACICCGGC^ 

a L P T Q Y I * G P A * L A F G R F V £ T 

b F L L N £ i EAQPDWRSGGSWR e'- 

er SYSlYLRPSLTGVREVRGD?- 

1141 + + * + ♦ t 1200 

a ? F L vpGLGCQO 5 P Q v a P P A P - 

h S P W F Q A L. D A P. I P R R L P R L P Q - 

f: L £ G ? R P W M P G F P A G C P A C P S - 

X2ni * ♦ * -+ i,?60 

OXNATGMZCGTTTAO^CCCXTO^ 

a A L A M A A ? V S G A A W E P R A V p - 

b R x W Q MR FLFLELLGNH A Q C p - 

C ? T G K C G p C F W S C L G T T K £ A r - 



FIGURE 51 (cont.) 



fil. 



1261 * + +- * 131:0 

a LRGVFQDAbHAASCGHKSSR - 

b y G V F L K T H C P L R A A V T P A A G - 

XGCSSRRTARCEtRSrQQPV- 



c 



c 



3 



5 



„„ . _ ■♦. — — — ■*- 

ArA0ACACO3CXrCTCTTCag 



; . C L C P G F. A P G I. C G G P R G G G T 0 

b VCAKKKPQCSVAAPEEEKHR- 
C V P G R S PRAT. WRPrp R P N T D - 



ACOXCGTCGCCTGCT^ 

+ + + + + h 

TGGGGGCAJGQ3GA30CAGGTOGACGAGGCQG 



TFVAWCSCSASTAAPGRCTA 
b ppSPGAAAPPAQQPt AGVRL- 

l*RRLVQLLRQMSS PWQVYGF- 



% UA1 ♦ ♦ 1500 

AGCAO50CCGGAC^5AO3CG<^ 

U! 0 S C G PACAGWCPQASGAFGTT - 

>£ . b H a GLPAPAGAPRPL G T. Q A Q R - 

O c v K k C h Vi. ^ h V ? ? (Z L V* G S F. n z - 



AACGCXXJCl'TCCTtlAGGAAC^ 

, ». ± - - — + _ — - — - ♦ 

TOXTGGCGAAGGAGTCCTT^^ 



NAASSGTPRSSSPWGEMPS 
b TP I. PQEHQGVHLPGEACtfAL 

c 



R R FL R-NTK K y I SLGK H A K L S - 
CGCiraGGAGCTCAl^^ 



1561 



•SCGAOGTCCTOCACTOCAOCTICTAC^^ 

RCRS*RGR * A C 0 T A L C C A G A 



- + 16-20 



AAGADVEOERAGLRLAAQEP 
t: L 0 E L T W K M S V K D C A W L R ft S P - 

cAGcwnGGcicixnrc 
1+21 v + + + leso 

iTICCO-aACCGAC^C^ 

h 0 0 L A V FkPQSTVCVRRs W T 5 - 

b p G W L C S C R K A r S A * G D P G C> v - 

c G V G C V p A A £ H R r.. P E E I L A K - 



FIGURE 51 (cont.) 



TCrCTGCACTGOTll^^ 
l^yX I ♦ - 1^40 

A S CTG**VCTSSGCSCLSFMS 

b FAI. ADECVKRRAAQVFLLCH- 

9 LHWf, HSVTVVELLRSFFYVT* 

1741 + - - — ♦ * 1800 

RRPRPKRTGSFST6RVSGAS - 
b GDHVSKEQALFLPBECLBQV- 

ETTFOKHHLPPYRK $ V W s K L - 



~~* "* + -+ 1 i. 

ACGTTIOrTAACCTTAGTCTG^^ 



1% " C K A r. ES05T*XCCSCGSCRK * 

15 b A KHWNOTALEECAAACAVGs?- 

0 S T G T R Q K L K R V Q t. R J? h S E A - 

1 . ,._ 

i f h + : + v«i 

<%XCCAC3CCC1t^ 



QRSGS£GKPGPPC*RFD5AS 
K C Q A A S C S 0 A R P A D V O T F L H 
£ V r q H R E A ft P A L L T S R L R F I 



TCCCOAC^CTV^ 

+ + - • -+ . + ♦ 

*>gg<htog<s\^^ 



31 S Pi* L T G " C G R L * T W T T S W E P £ 

b - P 0 A * R A A A D C E H C L R R g S O N 

PKPDGLRPIVNMD7 V V c A K T 

OffnraCACAGAAAAG*^^ 
1S81 # , , + _ _ + ;?fj4n 

GCAAGGaTICICI^ J«?102C 

* S A E K R G P £ V £ P K G * P. H C *3 A 
h v PO RK E GKAS H L EG E G T v p r 

F R * E K R A E R L T S R V K A L F 5 V - 

•rcCTOACTACCA^ 

, on ♦ * _ t + _ + 21Q0 

* C STTSGRGAPASWA P L C W A W - 
6 A « L * A G A A PRPPGRC. C A G P G - 
- L K V E R AKRPCLLG A S v l G I. D - 





FIGURE 51 (cont.) 



AO:,ATATCCAO»GGGCCTGG^^ 



2101 



TISTCPGAPSCCVCGFRTRR 
U i F- Q G L A H L R A A C AG P G PA. A - 
DIHRAWRTFVLRVR A 0 D P P P - 



ClCAciClOTACTTTGTCAAC^^ 

2161 - +- + + + < 2220 

C^CTOGACATGAAJ^^^ 

T. SCTLSRWM*RARTTPnFRT 
♦ A.VLCQGGCDORVRHHPPGQ 
ELYFVKVDVTCAYDT i P Q "l3 K - 

GCCiX^OGGA0<n^TCGCC^G» 

2221 ♦ * + + )• 2260 

CCGAGTOJtnCCAG^^ 

•3 S R R S S P A S S N P R T P T A C V t? 
AHGGHRQHHQTP EHVLRASV- 

LTEVIASI ikpqntycvrry- 

ATttXX7n3GTCCAGAACCC£^ 

:-?2fil <■ + + * + + 2340 

TAO>^CCAGGTCTrC^^ 

MPWSRRPPMGTSARP5RATS 

CRGP£CRPWARPOGLOEPRL- 
AVVQKAAHGHVRKAFKSHVS 

CTACCrn^CAGAWJXX^XXXTTACAl^ 

2341 + * + * * * ."MOO 

GATCGAiOGTCTOGAGGTOGQCAT^ 

L P * 0 T 5; S R T C D S 5 W L T C R > T 
Y L D R PpAVHATVRGS P A G ? Q - 
T F.. T D L 0 P Y M A O f V A H L Q ? N S - 

2401 + + + + — + 2460 

A R * CS M P S S S 3 R A P P * M R P A V 
PAEGCRRHRAELLPE'GQ 0 W - 
P L R D A V V I EQ5SSLNEASSG- 

GCCItTKXACCTCT 

24Cl + + + i »• * 2520 

ASSTSSYASC A T T P C A S C A S 
PLRRLPTLHVPPRRAKQGOV- 
LFOVfr'LRFMCHHAVRTRGKS- 



FIGURE 51 (cont.) 



ccrAorrcc^G™^ 

+ + r + + 25$'-? 

GGATWX^'JCGG^^ 



p T 5 5 A R G 55 R P ft P S S P R C S A A 
L R P V P G DPAGLHTT. KAALQP- 
Y V 0C0CIPQGS1LSTLLC S T.. - 

>5K1 * ♦ * * + ' * ** 2640 

C&TATWkTSCLRCFGGTCCS 
v L R R H G EQAVCGD5AGFAAP- 
C Y G D M F. W K L f A G I RFlDGLLL- 

— 1 *- + ♦ + 2700 

CVVJMMISCW'HLTSP'fKKPS 
A F G G * FT. VGDTS rHTRENLP- 
R t. V D D F L L V T P H L T H A K T h m L - 

TCAGGAaJCttXaCCC^^ 

^701 — r + * + + * 2750 

ACTTQTTraGACC*/^ 

5CFWSEVSLSMA h W * T C G R 0 - 
QDPGPRCP * V W I. R G E L A F. - O S - 

F T L V R G V P E Y G C V V N L R K T V - 



ACCACTTGA-VX*?^ 



W * T 5 L * KTK PWVAKtiliFRCK 
GELFCRRRCPCWHCFCSD A G - 
V N F P V . E D E A h G G T A F V £ M F A - 

COlAO/JC^ 

2821 + * + + + 1 2680 

GGGTOGCSGATMOS^^ 

p T A Y $ P C AACCWIPGPWRCR - 
p R ? I ? L V R P A A G Y P D P G G A E - 
„H^t,PPWCGLLLCTRTt,5VQ55- 

GGGACTACTQCAtXT&TCCXDO^ 

2£iffl ♦ * + * * 2940 

CGCTGATCAGGira*:!^^ 

ATTPAMFGPPSEPVSPSTAA 
RLLQLCPDLH Q ? Q S II L 0 P R L - 

PYSSYARTS I R A S I. T F N R G P - 



FIGURE 51 (cont.) 



2541 + 1 + + + r 5000 



b 



5> F: L G 5 T C V A N S L G S C 2 * 3 V T 
Q - W £ £ H A SQTLWCLAAEVS 0 



K A 



RNMRRKT. F G V T. R T. K C H 



3001 1- f + 1- + r 30** 

CGGACAtAGACCTAAACGTCCACITCyiCOGA 



■3 

b 



A C F W I C R * T A S K K C A F T 5 T K 
l» v $ C F A G E 0 P P D G V H Q H L Q D - 
LPLDLOVNSLQTVCTNT V K T - 



2061 + + + - •+ - ■ + ♦ 31. :0 

AWAGGACGACGTCOXLATOJC^^ 









b 








C 






; 55 
Ml 




pa 








pi j 


















b 




c 














; 2 








fff 


a 




b 







S S C C R R T G F T H V C C S £ H V 1 S 
P F A A G V 0 V S R M C A A A P I S S A - 
L I* I. QAYRFHACVLOLFFH O O - 

AAGTTIXXWWSAACO^ 

3121 -+ * * 1- + . 2120 

TTCAAACCHCITO^^ 

K F G R T P M F S C A $ S L T I* P PSA 
SLEEPHIFPARHI, *HGLPl, i. 
VWKNPTFFLP. VISDY ASLCY 

ACTCCATOraAM^ 

3181 + * + + + :i24H 

TGAGGTAGGACmttX?TK^ 

TPS'KFRTQGCRHGPRAFPA 
LHFF. SQERRDVACGQGRRRP 
SILK A K N A CMS L C A K C A A C P - 



3241 * - ♦ * - ?.:-:oo 

GAa*03GGAGGCir;a^^ 

5 LCPPRPCSGCATKH5C £ S * T, - 

b S A J. R G R A V A V P P S IP A Q A D S 

c LPSCAVQWLCHQAFLLKLTP. - 



?..^ni * «- -t + +- • 4 - lifin 



h DTVSFTCHSWGKPG Q F R P S * 

l» TPCHLRATPGVTQDSP D A A E 

C H R V T i V FLLGSLRT A Q T Q T. f. 



% % 

FIGURE 51 (cont.) 

* * + + + \i20 

V 0 F S R G R R *T. PWRPQFTRHC 
SEAPGDDADCPGGRSQP G T r. - 

rklfgttltaleaaan e a l ? - 

CCTCAGACnCiJ>.CL*CO 

f + i + * - uisn 

OGAGTCTGMGTI^ 

PQTSR PSWTOGHPPTAFPRA 
LRLQDHPGLM A T P. P Q P G F. F. £ - 

S D F K T I L D • W F F A H S? g A f S H - 

« * - + * » 3540 

D T SSPVTPGKTSQGGRGGPH 
O b TPAAL5RRALKFKECGAAHT- 

Q . C H Q Q P C H A G L Y V P G R EG P. P T P - 

■*J2 ixac^ccgcaccgc^ 

$ 3541 * ♦ f * 3600 

' ■ 

Iff rj PtfPHP. WE$EA*VSVWFKFAC 

^ b OAPTAGSLRPK*VPGRGLH V - 

c kPAPLGV*GLSECt*AEACMS- 

1*7 a^GCTCAAGGClx^ 

P.* 3(501 *■ * t + + 3*£n 

GGOCGAC ITLU^JCKJICAOC^ 

p A E G * V S G * G L S E C P A K C * V 
R L K A £ C PA £ A * A 5 V Q P R A E C - 
G • R L V R L R P E R V S 5 Q G L S V - 

TCCA*yjACACCTC<:03TCT 

IHitl + - * ► + • 3710 

AGGTCfflGTOGAOGGCAC^Jt^^ 

5 STPAVFTSPOAGARI-HPR A 
b P A H L P 5 S L< PHRLA-LGST P C P - 

QHTCRLHF P T G W R $ A P P Q G 0 - 

AGCriTIVjCTCACCA<XA' 

» £ FSSPGARLPLPT* E • 5 T P R 

b AFPHQEFGFHSPHRNSPSFtt- 

C T. r L T K S P A S* T P H I »5 I V H F Q I - 



% * 

FIGURE 51 (cont) 



tio:x;caTOjITCa^ 

3761 + * - -* »• • 3640 

AAOCX3GTfcACA4GTOGGCAflG3GGA^ 

j F A I v n p s P C P P L F S T P T I 0 v 

b sslftprpallclp P P P S R w - 

c P nCTSPLALPSFAFHPHH P - * 

•» K . n - + * . - + - - - !■ 3-1-00 

t:TCT^V,ACTCTTCC^ 

- FTLKRTLG A L v? T W S D Q R C A L 

}, K p * E 0 P W E I. W E F G V T K <i V P '1* - 

C D ? E K D r G 2 3 G N L £ * P K V C P V - 



liOu) «■ .--+ + 4- - - * 39£0 

2 ATClXTTCXGeiCCTTJXACW^ 

Ml a yTGEDPAPGWGST,WVKLGGC 

«|, b '!' O A P. T L H L *> G G I* C C S N W G E V - 

flj c HRRCPCTWMCVPVOQIGGKC 

3 of.! + + * + 1 » 4020 

ccACAoocTCATimaxy^^ 



a A V .3 V K Y * IYEFFSFEKK K K K 

b LWE'WTEYMSFSVLKKKKK K 

c: CGSKTLNI*VPQF * K K K K K K - 

AAAAAAAAA 

4021 4029 

T7TTTTTTT 

i K K K 

1> K K 

C K K 



FIGURE 52 



BSPH1 6951 
APAL1 6783, 



BSPH1 5943 



APAL1 5537 



in- 



SAC 2 4817 
81' NOT1 4 807 

O BAMHl 4 

f|| SMA1 478 
ECOR1 4 771 
N0T1 4763 
SAC 2 4 752 
NOT! 4748 



DRA3 22 5 . 

XHOl 674 
:eCOR5 7 01 
ECOR1 7 07 
NOT1 713 
MSC1 852 

SAC 2 1102 



SMA1 1435 
SMA1 1555 




SMA1 2 061 



MSC1 2403 



XHOl 2746 
EC OR 5 2 334 



SM&1 4102 

APAL1 3 772 

SAC 2 3 664' 



DRA3 3227 
BAMHl 3270 



% % 



FIGURE 53 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 , 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

11 ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 

U GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

HI 

9 80 90 

1X1 leu lys glu leu val ala arg val leu gin arg leu cys glu arg 

CTG AAG GAG CTG- GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala iys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG A^C GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 



FIGURE 53 (cont.) 



140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 
CTG GGA TGC GAA CGG GCC TGG 



val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 



150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 

280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 
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2S0 

gly ala leu ser gly thr arg 
GGT GCG CTG TCT GGC ACG CGC 



gin his his ala gly pro pro 
CAG CAC CAC GCG GGC CCC CCA 

320 

crp asp thr pro cys pro pro 
TGG GAC ACG CCT TGT CCC CCG 



leu tyr ser ser gly asp lys 
CTC TAC TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr iie phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 

leu phe leu glu leu leu gly 
CTG TTT CTG GAG CTG CTT GGG 

410 

val leu ieu lys thr his cys 
GTG CTC CTC AAG ACG CAC TGC 



ala ala gly val cys ala arg 
GCA GCC GGT GTC TGT GCC CGG 



300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTC CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GOT CGG AGG CTC GTG 

370 

arg pro trp met pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr trp gin met arg pro 
CAG CGC TAC TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG CAG TGC CCC TAC GGG 

420 

pro leu arg ala ala val thr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 

430 

glu lys pro gin gly ser val ala 
GAG AAG CCC CAG GGC TCT GTG GCG 
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440 450 
aia pre glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

l^u arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 m 480 

a la cvs leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arcr arg phe leu arg asn thr 1-ys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys one leu his trp leu met ser val tyr val val glu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 • 570 

leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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ser glu ala glu val arg gin 
TCG GAA GCA GAG GTC AGG CAG 

620 

leu thr ser arg leu arg phe 
CTG ACG TCC AGA CTC CGC TTC 



pro ile val asn met asp tyr 
GCG ATT GTG AAC ATG GAC TAC 



610 

his arg glu ala arg pro ala leu 
CAT CGG GAA GCC AGG CCC GCC CTG 

630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 



650 

arg glu lys arg ala glu arg 
AGA GAA AAG AGG GCC GAG CGT 



phe ser val leu asn tyr glu 
TTC AGC GTG CTC AAC TAC GAG 

680 

gly ala ser val leu gly leu 
GGC GCC TCT GTG CTG GGC CTG 



thr phe val leu arg val arg 
ACC TTC GTG CTG CGT GTG CGG 



660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

690 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 



710 

tyr phe val lys val asp val 
TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 

740 

asn thr tyr cys val arg arg 
AAC ACG TAC TGC GTG CGT CGG 



his gly his val arg lys ala 
CAT GGG CAC GTC CGC AAG GCC 



720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 

750 

tyr ala val val gin lys ala ala 
TAT GCC GTG GTC CAG AAG GCC GCC 

760 

phe lys ser his val ser thr leu 
TTC AAG AGC CAC GTC TCT ACC TTG 
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770 

thr asp leu gin pro tyr met 
ACA GAC CTC CAG CCG TAC ATG 



glu thr ser pro leu arg asp 
GAG ACC AGC CCG CTG AGG GAT 

S00 

ser leu asn glu ala ser ser 
TCC CTG AAT GAG GCC AGC AGT 



phe met cys his his ala val 
TTC ATG TGC CAC CAC GCC GTG 

830 

gin cys gin gly ile pro gin 
CAG TGC CAG GGG ATC CCG CAG 



cys ser leu cys tyr gly asp 
TGC AGC CTG TGC TAC GGC GAC 

860 

ile arg arg asp gly leu leu 
ATT CGG CGG GAC GGG CTG CTC 



leu val thr pro his leu thr 
TTG GTG ACA CCT CAC CTC ACC 

890 

leu val arg gly val pro glu 

CTG GTC CGA GGT GTC CCT GAG 



lys thr val val asn phe pro 
AAG ACA GTG GTG AAC TTC CCT 



3 (cont.) 

780 

arg gin phe val ala his leu gin 
CGA CAG TTC GTG GCT CAC CTG CAG 

790 

ala val val ile glu gin ser ser 
GCC GTC GTC ATC GAG CAG AGC TCC 

810 

gly leu phe asp val phe leu arg 
GGC CTC TTC GAC GTC TTC CTA CGC 

820 

arg ile arg gly lys ser tyr val 
CGC ATC AGG GGC AAG TCC TAC GTC 

840 

gly ser ile leu ser thr leu leu 
GGC TCC ATC CTC TCC ACG CTG CTC 

850 

met glu asn lys leu phe ala gly 
ATG GAG AAC AAG CTG TTT GCG GGG 

870 

leu arg leu val asp asp phe leu 
CTG CGT TTG GTG GAT GAT TTC TTG 

880 

his ala lys thr phe leu arg thr 
CAC GCG AAA ACC TTC CTC AGG ACC 

900 

tyr gly cys val val asn leu arg 

TAT GGC TGC GTG GTG AAC TTG CGG 
910 

val glu asp glu ala leu gly gly 
GTA GAA GAC GAG GCC CTG GGT GGC 



920 930 
thr ala phe val gin met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
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gly leu leu ieu asp thr arg 
GGC CTG CTG CTG GAT ACC CGG 

950 

ser ser ~yr ala arg thr ser 
TCC AGC TAT GCC CGG ACC TCC 



arg gly phe lys ala gly arg 
CGC GGC TTC AAG GCT GGG AGG 

980 

val ieu arg leu lys cys his 
GTC TTG CGG CTG AAG TGT CAC 



asn ser leu gin thr val cys 
AAC AGC CTC CAG ACG GTG TGC 

1010 

leu gin ala tyr arg phe his 

CTG CAG GCG TAC AGG TTT CAC 



his gin gin val trp lys asn 
CAT CAG CAA GTT TGG AAG AAC 

1040 

ser asp zhr ala ser leu cys 
TCT GAC ACG GCC TCC CTC TGC 



ala gly met ser leu gly ala 
GCA GGC- ATG TCG CTG GGG GCC 

1070 

ser glu ala val gin trp leu 
TCC GAG GCC GTG CAG TGG CTG 



940 

thr leu glu val gin ser asp tyr 
ACC CTG GAG GTG CAG AGC GAC TAC 

960 

ile arg ala ser val thr phe asn 
ATC AGA GCC AGT GTC ACC TTC AAC 

970 

asn met arg arg lys leu phe gly 
AAC ATG CGT CGC AAA CTC TTT GGG 

990 

ser leu phe leu asp leu gin val 
AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

thr asn ile tyr lys ile leu leu 

ACC AAC ATC TAC AAG ATC CTC CTG 

1020 

ala cys val ieu gin leu pro phe 

GCA TGT GTG CTG CAG CTC CCA TTT 
1030 

pro thr phe phe ieu arg val ile 
CCC ACA TTT TTC CTG CGC GTC ATC 

1050 

tyr ser ile leu lys ala lys asn 
TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

lys gly ala ala gly pro leu pro 
AAG GGC GCC GCC GGC CCT CTG CCC 

1080 

cys his gin ala phe leu leu lys 
TGC CAC CAA GCA TTC CTG CTC AAG 



1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 mo 

arg thr a la gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG AC? GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAC-CAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAC^CGAGGACCCTGCACCTGGATGGC^-GTCCCTGTGC^oTCAAATTGGGGGGAGGTGC 

TGTGGGAC-TAAAATACTGA^TATATGAGTTTTTCAGTTTTGRAAAAAAAAV^AAAAAAAA 

AAAAAAAA&A 
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